Improved Interpretability of the Unified Distance Matrix with Connected Components

نویسندگان

  • Lutz Hamel
  • Chris W. Brown
چکیده

Self-organizing maps have been adopted in many fields as the data visualization method of choice. The unified distance matrix is the de facto standard for evaluating and interpreting self-organizing maps. In large, high-dimensional problems clusters can be difficult to identify in the plain unified distance matrix. Here we introduce an enhanced version of the unified distance matrix in which clusters are easier to see and interpret. In this enhanced version we view the self-organizing map as a planar graph where the clusters are connected components of this graph. Using the transitive properties of connectedness and exploiting the fact that each component has a minimal node where the gradient on the unified distance matrix is equal to zero we can transform these connected components into stars with the minimal node as the internal node. In order to avoid unnecessary fragmentation of the components we apply a kernel based smoothing algorithm to the unified distance matrix. Our enhanced unified distance matrix is then the smoothed original unified distance matrix with the star components overlaid. The result is an easily interpretable selforganizing map. We perform a number of experiments on synthetic as well as real-world data that highlight the increased visual power of this enhanced unified distance matrix. Fig. 1. A unified distance matrix.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the spectra of reduced distance matrix of the generalized Bethe trees

Let G be a simple connected graph and {v_1,v_2,..., v_k} be the set of pendent (vertices of degree one) vertices of G. The reduced distance matrix of G is a square matrix whose (i,j)-entry is the topological distance between v_i and v_j of G. In this paper, we compute the spectrum of the reduced distance matrix of the generalized Bethe trees.

متن کامل

Some results on the energy of the minimum dominating distance signless Laplacian matrix assigned to graphs

Let G be a simple connected graph. The transmission of any vertex v of a graph G is defined as the sum of distances of a vertex v from all other vertices in a graph G. Then the distance signless Laplacian matrix of G is defined as D^{Q}(G)=D(G)+Tr(G), where D(G) denotes the distance matrix of graphs and Tr(G) is the diagonal matrix of vertex transmissions of G. For a given minimum dominating se...

متن کامل

The Existence Theorem for Contractive Mappings on $wt$-distance in $b$-metric Spaces Endowed with a Graph and its Application

In this paper, we study the existence and uniqueness of fixed points for mappings with respect to a $wt$-distance in $b$-metric spaces endowed with a graph. Our results are significant, since we replace the condition of continuity of mapping with the condition of orbitally $G$-continuity of mapping and we consider $b$-metric spaces with graph instead of $b$-metric spaces, under which can be gen...

متن کامل

On the distance eigenvalues of Cayley graphs

In this paper, we determine the distance matrix and its characteristic polynomial of a Cayley graph over a group G in terms of irreducible representations of G. We give exact formulas for n-prisms, hexagonal torus network and cubic Cayley graphs over abelian groups. We construct an innite family of distance integral Cayley graphs. Also we prove that a nite abelian group G admits a connected...

متن کامل

A UNIFIED MODEL FOR RESOURCE-CONSTRAINED PROJECT SCHEDULING PROBLEM WITH UNCERTAIN ACTIVITY DURATIONS

In this paper we present a unified (probabilistic/possibilistic) model for resource-constrained project scheduling problem (RCPSP) with uncertain activity durations and a concept of a heuristic approach connected to the theoretical model. It is shown that the uncertainty management can be built into any heuristic algorithm developed to solve RCPSP with deterministic activity durations. The esse...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011